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IN THE CLAIMS; 

Please amend the claims as indicated below. Any difference between a claim not marked 
"Currently Amended" below and the previous state of that claim is unintentional and in the nature of 
a clerical or typographical error. 

1 . (Currently Amended) An apparatus for executing at least one single program multiple 
data (SPMD) program, said apparatus comprising: 

a micro single instruction multiple data (SIMD) unit associated with a microprocessor; and 
a job buffer having an output coupled to an input of said micro SIMD unit, 
wherein said job buffer is configured to: 

at runtime, compare a job status of a plurality of jobs; 

dynamically bundle a subset of said plurality of jobs into a task based on an 
equivalence of a job status of said jobs on said comparison ; and 

allocate said task to said micro SIMD unit, and 
wherein said job status comprises a program counter value and a loop-counter list, and 
wherein a job is a combination of a program and an input data-set. 

2. (Currently Amended) The apparatus as set forth in Claim 1 wherein said micro SIMD 
unit is configured to send capable of sending job status information to said job buffer. 

3. (Original) The apparatus as set forth in Claim 1 wherein said at least one SPMD 
program comprises a plurality of input data streams having moderate diversification of control flows. 



-2- 



Docket No. 03-LJ-064 (STMIOl-03064) 
Serial No. 10/714,179 
Patent 

4. (Previously Presented) The apparatus as set forth in Claim 3 wherein said apparatus 
is configured to execute said at least one SPMD program once for each input data stream of said 
plurality of input data streams. 

5. (Previously Presented) The apparatus as set forth in Claim 4 wherein said apparatus 
is configured to generate an instruction stream for each input data stream of said plurality of input 
data streams. 

6. (Previously Presented) The apparatus as set forth in Claim 3 wherein said apparatus 
is configured to execute a plurality of SPMD programs and wherein each SPMD program of said 
plurality of SPMD programs is executed on a number of input data streams. 

7. (Original) The apparatus as set forth in Claim 6 wherein said number of input data 
streams is greater than a program granularity threshold. 

8. (Canceled). 

9. (Previously Presented) The apparatus as set forth in Claim 1 wherein said apparatus 
is configured to perform job clustering to form a job bundle in which each job in said job bundle has 
an equivalent control flow. 
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10. (Previously Presented) The apparatus as set forth in Claim 9 wherein said apparatus 
is configured to perform said job clustering based on a job processing status of said jobs in said job 
bundle. 

1 1 . (Previously Presented) The apparatus as set forth in Claim 1 wherein said apparatus 
is configured to force a task to terminate at a point where a job control path might fork by placing a 
code-stop in said task. 

12. (Previously Presented) The apparatus as set forth in Claim 11 wherein said apparatus 
is configured to minimize a required number of code-stops to be placed in said task by excluding 
from code-stop placement each control flow statements that is equivalent to a select instruction. 

13. (Previously Presented) The apparatus as set forth in Claim 9 wherein said apparatus 
' is configured to maximize a size of a job cluster by selecting tasks for execution in which a job 

processing status of each of said tasks is complete. 

14. (Previously Presented) The apparatus as set forth in Claim 1 wherein said apparatus 
is configured to execute a data loading phase for a task before said apparatus executes a task 
execution phase for said task. 
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15. (Currently Amended) A method for executing at least one single program multiple 
data (SPMD) program, for use with a micro single instruction multiple data (SIMD) unit associated 
with a microprocessor, and a job buffer having an output coupled to an input of said micro SIMD 
unit, said method comprisin g the steps of : 

at runtime, comparing a job status of a plurality of jobs; 

dynamically bundling a subset of said plurality of jobs into a task based on an equivalence of 
a job status of said jobs; and 

in a job buffer, allocating said task to a micro single instruction multiple data (SIMD) unit 
said micro SIMD unit in said job buffe r , wherein said micro SIMD unit is associated with a 
microprocessor, and said job buffer has an output coupled to an input of said micro SIMD unit , 

wherein said job status comprises a program counter value and a loop-counter list, and 
wherein a job is a combination of a program and an input data-set. 

1 6 . (Currently Amended) The method as set forth in Claim 15 further comprising-the^t^ 

ef: 

sending job status information from said SIMD unit to said job buffer. 

17. (Original) The method as set forth in Claim 15 wherein said at least one SPMD 
program comprises a plurality of input data streams having moderate diversification of control flows. 
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18. (Currently Amended) The method as set forth in Claim 17 further comprising4fee-step 

ef: 

executing said at least one SPMD program once for each input data stream of said plurality of 
input data streams. 

1 9 . (Currently Amended) The method as set forth in Claim 1 8 further comprising-the^tep 

ef: 

generating an instruction stream for each input data stream of said plurality of input data 
streams. 

20. (Currently Amended) The method as set forth in Claim 17 further comprising-the 
steps of : 

executing a plurality of SPMD programs; and 

executing each SPMD program of said plurality of SPMD programs on a number of input 
data streams. 

21. (Original) The method as set forth in Claim 20 wherein said number of input data 
streams is greater than a program granularity threshold. 

22. (Canceled). 
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23. (Currently Amended) The method as set forth in Claim 15 further comprising-the 

step of : 

performing job clustering to form a job bundle in which each job in said job bundle has an 
equivalent control flow. 

24. (Currently Amended) The method as set forth in Claim 23 further comprising-the 

step of : 

performing said job clustering based on a job processing status of said jobs in said job 

bundle. 

25. (Currently Amended) The method as set forth in Claim 15 further comprising-the 

step of ; 

forcing a task to terminate at a point where a job control path might fork by placing a code- 
stop in said task. 

26. (Currently Amended) «The method as set forth in Claim 25 further comprising-the 

step of : 

minimizing a required number of code-stops to be placed in said task by excluding from 
code-stop placement each control flow statements that is equivalent to a select instruction. 
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27. (Currently Amended) The method as set forth in Claim 23 further comprising-the 

step of : 

maximizing a size of a job cluster by selecting tasks for execution in which a job processing 
status of each of said tasks is complete, 

28. (Currently Amended) The method as set forth in Claim 15 further comprising-the 

step of : 

executing a data loading phase for a task before executing a task execution phase for said 

task. 
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